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Context. Understanding the formation and evolution of young star clusters requires quantitative statistical measures of their structure. 
Aims. We investigate the structures of observed and modelled star-forming clusters. By considering the different evolutionary classes in the 
observations and the temporal evolution in models of gravoturbulent fragmentation, we study the temporal evolution of the cluster structures. 
Methods. We apply different statistical methods, in particular the normalised mean correlation length and the minimum spanning tree 
technique. We refine the normalisation of the clustering parameters by defining the area using the normalised convex hull of the objects and 
investigate the effect of two-dimensional projection of three-dimensional clusters. We introduce a new measure ^ for the elongation of a cluster. 
It is defined as the ratio of the cluster radius determined by an enclosing circle to the cluster radius derived from the normalised convex hull. 
Results. The mean separation of young stars increases with the evolutionary class, reflecting the expansion of the cluster. The clustering 
parameters of the model clusters correspond in many cases well to those from observed ones, especially when the ^ values are similar. No 
correlation of the clustering parameters with the turbulent environment of the molecular cloud is found, indicating that possible influences 
of the environment on the clustering behaviour are quickly smoothed out by the stellar velocity dispersion. The temporal evolution of the 
clustering parameters shows that the star cluster builds up from several subclusters and evolves to a more centrally concentrated cluster, while 
the cluster expands slower than new stars are formed. 

Key words, stars: formation - stars: pre-main sequence - ISM: clouds - Open clusters and associations; general - Methods: statistical 
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1. Introduction 

Almost all stars form in clusters. Embedded clusters contain 
various types of young stars, making them ideally suited to 
study the early stages of star formation as they provide a large 
and genetically homogeneous sample (see Lada & Lada l200'^ 
for a review). Understanding the formation and evolution of 
young stellar clusters requires quantitative statistical measures 
of their structure, which may give important clues to the for- 
mation process. While some clusters are centrally concentrated 
with a smooth radial density gradient, others show filaments 
and signs of fractal subclustering. If and how different struc- 
tures are connected to the environmental conditions of the 
molecular clouds and how they depend on the evolutionary 
stage of the cluster is not yet clear. 

Different methods have been used to describe the cluster- 
ing properties of star clusters, e.g., the mean surface density 
of companions (Larson I1995> or spanning trees. Cartwright 
& Whitworth (2004; hereafter 'CW04 ) presented a review of 
various statistical methods for analysing the structures of star 
clusters and a detailed investigation of both observed and artifi- 
cially created clusters. We apply the methods discussed there to 
clusters created by numerical simulations of gravoturbulent star 
formation and investigate the clustering behaviour with time. 



We extend the analysis of observed clusters by considering dif- 
ferent evolutionary classes. 

Four classes of young stellar objects (YSOs) are distin- 
guished according to the properties of their spectral energy dis- 
tributions (SEDs) (e.g. Andre et al. .2000j : Class sources are 
deeply embedded protostars with a large sub-mm to bolomet- 
ric luminosity ratio (Lsmm/Lboi > 0.005). The Class stage is 
the main accretion phase and lasts only a few 10"* yr. Class 1 
objects are relatively evolved protostars, which are surrounded 
by an accretion disc and a circumstellar envelope. Pre-main- 
sequence stars in Class 2 and 3 correspond to classical and 
weak line T Tauri stars, respectively. They are characterised by 
a circumstellar disc (optically thick in Class 2, optically thin in 
Class 3) and the lack of a dense circumstellar envelope. These 
four classes are usually interpreted as an evolutionary sequence 
from Class to 3. The progenitors of these forming stars are 
prestellar cores (starless cores, prestellar condensations). These 
are gravitationally bound, dense molecular cloud cores with 
typical stellar masses that may already be in a state of collapse, 
but have not formed a central protostellar object yet. 

Sect.|2lexplains the statistical methods used. In Sect.|3land 
13 we describe the observations and the models, respectively, 
and in Sect.|5]the application of the methods to the data. The re- 
sults are analysed and discussed in Sect.|6l while we present our 
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Table 1. Clustering measures and numbers for the observed star-forming regions for prestellar cores (p). Class 1, 2, 3 sources, 
and all YSOs. (Note that the total number of YSOs is larger than the sum of objects in the individual classes, because it contains 
Class sources and objects with unclear classification not considered in the analysis of the individual classes.) For details see the 
discussion in Sect. 16. II 



Region 


Class 


n 


s['] 


s [pc] 


m['] 


m [pc] 


s 


m 


Q 


5* 


m' 






p Ophiuchi 


P 


98 


20.45 


0.83 


2.71 


0.11 


0.67 


0.55 


0.82 


0.49 


0.40 


0.81 


1.36 




1 


15 


11.82 


0.48 


3.15 


0.13 


1.13 


0.68 


0.60 


0.77 


0.43 


0.56 


1.47 




2 


111 


15.86 


0.65 


2.25 


0.09 


0.69 


0.58 


0.83 


0.54 


0.63 


1.15 


1.27 




3 


77 


16.25 


0.66 


2.53 


0.10 


0.82 


0.63 


0.76 


0.70 


0.66 


0.93 


1.17 




all 


205 


15.63 


0.64 


1.67 


0.07 


0.67 


0.57 


0.85 


0.53 


0.45 


0.85 


1.28 


Serpens 


1 


19 


3.43 


0.26 


1.12 


0.09 


0.99 


0.82 


0.83 


0.27 


0.22 


0.79 


3.61 




2 


18 


7.85 


0.59 


1.74 


0.13 


0.74 


0.61 


0.82 


0.52 


0.42 


0.80 


1.42 




all 


80 


5.98 


0.45 


1.06 


0.08 


0.60 


0.54 


0.90 


0.37 


0.32 


0.89 


1.63 


Taurus 


P 


52 


281.0 


11.47 


45.88 


1.87 


0.77 


0.48 


0.62 


0.59 


0.36 


0.61 


1.29 




1 


25 


265.6 


10.84 


73.18 


2.98 


0.83 


0.60 


0.73 


0.59 


0.41 


0.70 


1.39 




2 


108 


333.5 


13.62 


34.30 


1.40 


0.69 


0.41 


0.59 


0.43 


0.54 


1.26 


1.61 




3 


72 


414.4 


16.96 


52.99 


2.16 


0.84 


0.51 


0.60 


0.77 


0.63 


0.82 


1.09 




all 


197 


356.8 


14.58 


25.98 


1.06 


0.67 


0.39 


0.58 


0.46 


0.27 


0.58 


1.45 


IC 348 


all 


288 


6.86 


0.63 


0.66 


0.06 


0.56 


0.52 


0.93 


0.49 


0.45 


0.92 


1.16 


Cfiamaeleon I 


all 


180 


41.51 


1.81 


3.74 


0.16 


0.55 


0.37 


0.68 


0.50 


0.34 


0.68 


1.32 



conclusions in Sect. The Appendices give the details about 
the normalisation of the clustering parameters. 

2. Statistical Methods 

2.1. Normalised correlation length 

A wide range of statistical methods has been developed to anal- 
yse the structure of star clusters (see 'CW04 for a review). A 
simple approach is to study the distribution of source separa- 
tions, as it has been done e.g. by Kaas et al. J2004> for the 
Serpens cloud core. Larson jl995t . extending the analysis by 
Gomez et al. (J993,) . introduced the mean surface density of 
companions 2(0), a tool that since then has often been used 
to study star-forming clusters (e.g. Bate et al. 1998 Gladwin 
et al. 1199 9, Klessen & Kroupa 2001j . The mean surface den- 
sity of companions (MSDC) specifies the average number of 
neighbours per square degree on the sky at an angular separa- 
tion 9 for each cluster star. Knowing the distance to the cluster, 
can be converted to an absolute distance r to determine 2(r). 
The physical interpretation of the MSDC can be difficult, and 
ICW04l have shown that the normalised correlation length is a 
better indicator for the clustering behaviour The normalised 
correlation length s is the mean separation s between stars in 
the cluster, normalised by dividing by the radius of the cluster, 
^cluster- This radius is defined via the normalised convex hull of 
the objects (see AppendixlAl. The s values are independent of 
the number of stars in the cluster (ICW04t . 

2.2. Minimum spanning trees 

The minimum spanning tree (MST), a construct from graph 
theory, is the unique set of straight lines ("edges") connecting 
a given set of points ("vortices") without closed loops, such that 
the sum of the edge lengths is a minimum (Kruskal .1956. Prim 



119571 Gower & Ross '1969V In astrophysics, minimum span- 
ning trees have so far mainly been used to analyse the structure 
of galaxy clusters (e.g. Barrow et al. [I985 Adami & Mazure 
Doroshkevich et al.'2004». From the MST the normalised 
mean edge length m is derived. Unlike the mean separation 
length s, the mean edge length m depends on the number of 
stars in the cluster, therefore it has to be normalised by the fac- 
tor (A/n)'^^ (Marcelpoil 1993 1, where n is the total number of 
stars and A the two-dimensional area of the cluster In the three- 
dimensional case the normalisation factor is (V/n)'^^, where V 
is the volume of the cluster Area and volume are defined by the 
normalised convex hull of the objects (see Appendix IaIi. The 
normalisation factors are discussed in detail in AppendixiBl 

An additional reducing operation, called separating, can 
be used to isolate subclusters (Barrow et al. ll985> . Separating 
means removing all edges of the MST whose lengths exceed 
a certain limit. When removing edges from a MST, each re- 
maining subgraph is again a MST of its vortices (Robins et al. 
120001 . 

2.3. The combined measure Q 

The values s and m, on their own, can quantify, but cannot dis- 
tinguish between, a smooth large-scale radial density gradient 
and multiscale fractal subclustering. Dussert et al. ( I1986> com- 
bine the mean edge length m of a MST and its standard devia- 
tion (Tm and use the (m, crin)-plane to separate different degrees 
of order in various systems. However, CWOJ show that this 
is not sufficient to differentiate between a smooth large-scale 
radial density gradient and fractal subclustering. Therefore, 
CW04 introduced the parameter Q = m/s, which can provide 
this distinction. Large Q values (Q > 0.8) indicate centrally 
concentrated clusters having a volume density n oc r"", where 
Q increases with increasing a (i.e. with increasing degree of 
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central concentration). Small Q values (Q < 0.8) describe clus- 
ters with fractal substructure, where Q decreases with increas- 
ing degree of subclustering. 

2.4. The elongation of a cluster 

We also investigate the elongation of the clusters. We define 
the elongation ^ of a cluster as the ratio of the cluster radius 
defined by an enclosing circle to the cluster radius derived from 
the normalised convex hull of the objects. A value of ^ a; 1 
describes a spherical cluster, while a value of ^ « 3 corresponds 
to an elongated elliptical cluster with an axis ratio of a/b x 10. 
See Appendix Iclfor details. 

3. Observations 

Our observational data are based on the sample of YSOs in em- 
bedded clusters compiled from various sources as discussed in 
detail by Schmeja et al. (2005; here after IS KF05 h . However, in 
the current analysis only the regions p Ophiuchi, Taurus and 
Serpens will be studied in detail. These are clusters, where suf- 
ficient information on the evolutionary classes as well as on the 
positions is given in the literature. IC 348 and Chamaeleon I are 
used for determining additional clustering parameters. The data 
of p Ophiuchi are taken from Bontemps et al. f2001^ (YSOs) 
and Stanke et al. (2005 1 (prestellar cores), the Serpens data are 
from Kaas et al. (2004) (Class 1/2), Hurt & Barsony (1996 1, 
and Froebrich (2005 1 (Class 0), the Taurus data are taken from 
Hartmann (2002 1 (YSOs) and Lee & Myers (1999 1 (prestellar 
cores). The data of IC 348 are taken from Luhman et al. (2003]| 
and those of Chamaeleon I from Cambresy et al. (1998). For 
further details on the compilation of the original sample see 
ISKF05I The numbers of objects are given in Table[2 the posi- 
tions of all YSOs are plotted in the upper panel of Fig. ^ The 
adopted distances used to determine the absolute values of s 
and in are 140 pc for p Oph (Bontemps et al. 1200 It . 260 pc for 
Serpens (Kaas et al. 2004 1, 140 pc for Taurus (Hartmann'2002'), 
315 pc for IC 348 (L uhman et al. l2003l . and 150 pc for Cha I 
(Haikala et al. l2005l . 

4. The models 

We perform numerical simulations of the fragmentation and 
collapse of turbulent, self-gravitating gas clouds and the re- 
sulting formation and evolution of protostars as described in 
Schmeja & Klessen (2004; hereafter SK04). We use a code 
based on smoothed particle hydrodynamics (SPH; Monaghan 
I1992> in order to resolve large density contrasts and to follow 
the evolution over a long timescale. The code includes periodic 
boundary conditions (Klessen ll997> and sink particles (Bate et 
al.[r995 1 that replace high-density cores while keeping track of 
mass and linear and angular momentum. We determine the res- 
olution limit of our SPH calculations using the Bate & Burkert 
J1997> criterion. This is suflicient for the highly nonlinear den- 
sity fluctuations created by supersonic turbulence as confirmed 
by convergence studies with up to 10^ SPH particles (Jappsen 
et al. l2005l Li etal. 120051 . 



Our simulations consist of two globally unstable models 
that contract from Gaussian initial conditions without turbu- 
lence and of 22 models where turbulence is maintained with 
constant rms Mach numbers Ai, in the range 0.1 < M < 
10. We distinguish between turbulence that carries its energy 
mostly on large scales, at wavenumbers 1 < A; < 2, on interme- 
diate scales, i.e. 3 < A; < 4, and on small scales with 1 <k <d>. 
The naming of the models, Gl and G2 for the Gaussian runs, 
and M.M]di (with rms Mach number M and wavenumber k) 
for the turbulent models, follows SK04 Details of the individ- 
ual models are given in their Table 1 . 

The dynamical behaviour of isothermal self-gravitating gas 
is scale free and depends only on the ratio a between internal 
energy and potential energy: a - Ziint/I^potl- Since we are only 
interested in the positions of the young stars, and since all the 
clustering parameters are normalised by the cluster radius, the 
physical scaling is irrelevant for the present study. 

The YSO classes are determined as follows (see ISKF05I 
and Froebrich et al. 2005 for a detailed discussion): The be- 
ginning of Class is identified with the formation of the first 
hydrostatic core, when the central object has a mass of about 
0.01 Mq (Larson l2003l . The transition from Class to Class I 
is reached when the envelope mass is equal to the mass of the 
central protostar (Andre et al. l2000> . We determine the transi- 
tion from Class 1 to Class 2 when the optical depth of the re- 
maining envelope becomes unity at 2.2 yum (K-bwd). The end 
of Class and Class 1 corresponds to a mass of about 0.43 and 
0.85 times the final mass, respectively ( SKF05 1. Lacking a fea- 
sible criterion to distinguish Class 2 from Class 3 objects, we 
consider both classes combined. Prestellar cores are identified 
by a clump-finding algorithm described in Klessen & Burkert 
(2000, see also SKF05l. 

We only consider models with a numerical resolution of at 
least 200000 particles. Furthermore, in order to get reasonable 
numbers of protostars for the statistics, we select only those 
models where more than 37 protostars with Mend > 0.1 Mq 
(roughly corresponds to the detection limits of the observa- 
tions) are formed. This reduces our set of models to 16. Again, 
see Table 1 of iSK04 for further details. 

5. Application to the data 

We construct the MST following Prim's J1957> algorithm, as 
also described by Gower & Ross (1969 1. For both the observa- 
tions and the models, the parameters s, ill, Q, and ^ are com- 
puted for the different evolutionary classes independently (pro- 
vided, there is a sufficiently large number of YSOs of that class) 
as well as for the entire cluster. In no region is the number of 
Class sources large enough to be included. In addition, these 
parameters are determined at frequent timesteps of the simu- 
lations to obtain the temporal evolution of the parameters. In 
the case of m, this requires to construct the MST anew at ev- 
ery chosen timestep. As an example, Fig.|2]shows the MST of 
one model at dififerent evolutionary stages. While the parame- 
ters s, in and Q are calculated using the normalisation factor 
given above, s*, m* and Q* are determined with the normalisa- 
tion factor and cluster radius of .CW04. (see also the discussion 
in the Appendix). 
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Fig. 1. Upper panel: Observational data for the star-forming clusters p Ophiuchi, Serpens, and Taurus. Circles: Class 0, triangles: 
Class 1, diamonds: Class 2, squares: Class 3, asterisks: prestellar cores. The sources of the observational data are given in the 
text. Other panels: The MSTs of the same regions for all YSOs (second row), prestellar cores (third row), and Class 1 , 2, 3 objects 
(fourth, fifth, and sixth row). For Serpens, no data on prestellar cores and Class 3 objects are available. 
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SFE: 10% SFE: 20% SFE: 30% SFE: 40% 




Fig. 2. The 2D minimum spanning tree for the YSOs of model M6k4a projected into the xy -plane at a star formation efficiency 
of 10%, 20%, 30% and 40% (from left to right). 



As shown by'CW04' the effect of binary stars on the clus- 
tering parameters is not negligible. Since binaries create very 
short edge lengths, a large fraction of binaries will significantly 
reduce the mean edge length m (and as a consequence also 
change Q). As the binaries are not part of the clustering regime, 
their influence on the clustering parameters has to be mini- 
mized. While for most of the clusters it is not relevant, Taurus is 
known to have a large binary population. Thus we removed the 
known binary companions from our sample of YSOs in Taurus. 

We calculate the clustering parameters s, m, Q, and ^ for 
the model cluster in three dimensions, and, in order to compare 
them with the apparent two-dimensional observational data, 
projected into the xy-, xz-, and yz-plane. The positions of the 
stars are coiTected for the periodic boundary conditions. Only 
objects inside a box of ten times the side length of the original 
computational box are considered, if there are any objects out- 
side this volume, we assume that they have left the cluster and 
are not relevant for the clustering process. 

In the beginning stages of the cluster as a whole, and of 
the individual evolutionary classes, the number of objects is 
rather low. In such cases the analysis may not be statistically 
significant and has to be taken with a grain of salt. 

6. Discussion 

6.1. Observations 

Table[niists the clustering measures introduced in Sect. |2lfor 
the observations. Columns 4 to 7 list the (non-normalised) 
mean separation of objects and mean MST edge lengths in arc- 
minutes and parsec. Columns 8 to 10 give the parameters 5, m 
and Q, while Columns 1 1 to 13 list the same values calculated 
using the normalisation of CW04l(s*, m*, Q*). The last column 
gives the elongation ^. The values of s*, m* and Q* agree with 
the values given bv lCW04l except for s* and m* of Taurus (in- 
terestingly, however, our Q* is the same as theirs). We attribute 
small differences to the slightly different underlying samples, 
and the discrepancy for Taurus to the different treatment of bi- 
naries. Due to the different definition of the radius/area of the 
cluster, s* and ill* differ significantly from s and ill, while Q 
and Q* are roughly the same, in particular for large samples 
(see the discussion in Appendix iBt. Taurus and Chamaeleon I 



have substructure, while p Oph and IC 348 are centrally con- 
centrated clusters (see the discussion in lCW04t . Serpens (not 
discussed by CW04 ) has Q - 0.90, corresponding to a central 
concentration with a radial density exponent a « 1.9, similar 
to IC 348. 

The linear distances s and m are in the same range 
for p Oph, Serpens, and IC 348, significantly larger for 
Chamaeleon I, and about an order of magnitude larger in the 
case of Taurus, confirming the notion that Taurus represents 
a somewhat less clustered mode of star formation. However, 
when considering only the central part of the Taurus region, the 
values decrease significantly to i = 3.05 pc and m = 0.49 pc. 
The latter is comparable to the 0.3 pc estimated as the average 
distance to the nearest stellar neighbour in the central region of 
Taurus by Gomez et al. ( 1993 1 and Hartmann (2002). 

The mean separation s increases with the evolutionary class 
for all three clusters investigated in this regard, reflecting the 
expansion of the cluster, which can also be seen in Fig. ^ 
While Class protostars are formed in the high-density cen- 
tral regions, more evolved YSOs akeady had time to move to 
more remote regions (see also Kaas et al. .2004j . Note that the 
prestellar cores do not fit into this sequence, as they are dis- 
tributed over an area roughly as large as the entire cluster. Thus 
we speculate that not all objects identified as prestellar cores 
will eventually form stars. Only in the central parts of the clus- 
ter may the density be high enough to make the cores collapse. 
This is consistent with the findings of Vazquez-Semadeni et 
al. J2005> that a significant number of "failed cores" should 
exist, which may redisperse and which may correspond to the 
observed starless cores. 

The elongation of the clusters ranges from ^ = 1.16 for 
the almost perfectly spherical cluster IC 348 to ^ = 1.63 for 
Serpens. The ^ value can differ significantly for subclusters, 
e.g. the filamentary central part of the Taurus region has an 
elongation of ^ = 1.83. 

6.2. Models 

We compute the clustering parameters s, in, Q, and ^ for all 
models and compare them with each other as well as with those 
from the observations. Figure |3l shows the clustering parame- 
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Fig. 3. The clustering parameters s, m, Q, and ^ for all models at a SFE of 10% (upper panel) and 40% (lower panel), plotted 
versus the Mach number Shown are the values for the projection into the xy-, xz-, and yz-plane (diamonds, triangles and squares, 
respectively), and for the 3D analysis (filled circles). The horizontal lines show the corresponding values from the observational 
data for Serpens, Taurus, Chamaeleon I, IC 348, and p Oph (from left to right). 



ters for all models, sorted by the Mach number at a star for- 
mation efficiency (SFE) of 10% and 40%. The clustering pa- 
rameters of the observed clusters (from Table [0 are shown as 
horizontal lines at their Mach number (taken from SKF05). The 
s and ill values of the models are in general significantly larger 
than those from the observations. A particularly large discrep- 
ancy is noted when the model cluster is strongly elongated. A 
large f value reduces the normalisation factor and increases s 
and m. While the observed clusters show rather moderate elon- 
gations in the range 1.1 < ^ < 1.6, in some models the stars 
form in a single filament with an elongation ^ > 3. Model clus- 
ters with elongations in the range of the observations show also 
a good agreement in the other parameters. For example, the 
fairly spherical cluster of model M6k4a (shown in Fig.|2} has 
(at 40% SFE and projected into the xy-plane) an elongation of 
^ = 1.28 and the clustering parameters s - 0.72, ill = 0.64, 
and Q = 0.89. These values are almost identical to those of 
p Ophiuchi (^ = 1.28, s = 0.70, m = 0.59, Q = 0.85). For 
the other projections the values differ by less than 8%. Most of 
the Q values, which are independent of the area, lie in the same 
range as those from the observations. We find no correlation 
of the clustering parameters with the properties of the turbu- 
lent driving (Mach number or wave number k) of the models. 
Neither do the values from the observations show any correla- 
tion with the Mach number Thus we conclude that if there is 
any systematic influence of the turbulent environment on the 
clustering behaviour, it is only existent in the earliest phase of 
cluster formation, before it is smoothed out by the motions of 
the individual protostars (see also Bate et al. , 1998. 1. 

Fig Eshows the MST of the model cluster M6k4a at dififer- 
ent evolutionary stages and reveals the expansion of the cluster. 
We analyse the temporal evolution of the clustering parame- 
ters: 5(f), m(t), Qit), ^(t) in all models. As an example, Fig.0] 
shows this sequence for the said model M6k4a. The general 



behaviour is the same for all models: s and m decline slightly 
with time, while Q increases slowly or stays at a roughly con- 
stant value. This evolution is shown by the entire cluster as 
well as by the individual classes, although in later stages the 
latter values might fall to zero as the number of objects in a 
particular class becomes zero. Decreasing s and ili values in- 
dicate that star formation sets in in diff'erent, rather dispersed 
regions of the cloud. The cluster becomes denser as more and 
more gas is turned into protostars. New stars are formed faster 
than the cluster expands. According to the Q values, the cluster 
evolves slowly from fractal subclustering to a more centrally 
concentrated cluster although in no model does Q2D rise sig- 
nificantly above the 'divding value' of 0.8. This again shows 
that the cluster builds up from separate groups which will grow 
into a single, more centrally concentrated cluster, as also found 
by Bonnell et al. (l2003t and Clark et al. (l2005l . 

6.3. The effect of projection 

Looking at the two-dimensional projections of the 3D model 
clusters does not significantly change the picture as a whole. 
The individual 5, m and Q values can indeed differ for the 
projection into the xy, xz, and yz plane, but the qualitative be- 
haviour of the evolution is more or less the same, independent 
of the projection (Fig. |5}- While the s^u and mj,o values usu- 
ally are higher than the values of the projections, Q^u tends to 
be lower than the values of the projections. The investigation 
of several hundred randomly created clusters shows that s^u 
is always expected to be larger than the value for the projec- 
tions (sio/^iD ~ 1-2), while the m values can be in the range 
0.95 < m^u/tniD ^1-4 with a mean value of 1.1. In the ex- 
treme case, Q2U can differ by up to 30% from the 2D value. 
Note that the physical interpretation of Q as given bv .CW04. is 
based on the two-dimensional analysis. Therefore, for an inter- 
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Fig. 4. The temporal evolution of the 3D clustering parameters s, m, Q, and ^ for one model (M6k4a), dotted: Class 0, dashed: 
Class 1, dash-dotted: Class 2/3, solid line: entire cluster. 
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Fig. 5. The temporal evolution of the clustering parameters s, in, Q, and ^ of the model M6k4a for the 3D analysis (solid line) 
and for the projection into the xy (dotted), xz (dashed), and yz plane (dash-dotted). 
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Fig. 6. The MST of the Taurus cluster as a whole (left panel) and separated at I + 2cr^, I + cTm, and I + Q.5cr^ (from left to right). 



pretation of the numerical values (and not only the trend) of Q, 
the projected values have to be used. 

The elongation measure ^, on the other hand, depends 
strongly on the projection. Obviously, an elongated, filament- 
like structure seen from the side will look spherical when ob- 
served along its major axis. In the case of the models the three- 
dimensional value of ^ can be used to describe the true shape 
of the cluster. A large scatter in the ^20 values means a high ^30 
value and vice versa. 

6.4. The effect of separating 

We separate the observed clusters by succesively removing all 
MST edges with lengths I larger than I + 3cr^, I + 2cr^, I + cr^, 
and / + Q.ScTra, where (Xm denotes the standard deviation of the 



mean edge length of the MST. This process is demonstrated 
for the Taurus star-forming region in Fig.|6l When we compare 
all three regions with sufficient data, we see that for Serpens 
and Taurus only solitary stars in the outskirts of the cluster are 
removed, while the central part of the cluster stays connected. 
However, the more homogeneous p Oph cluster breaks down in 
two roughly equally large clusters at the last step. In Fig. we 
show the clustering parameters of the largest remaining sub- 
cluster (i.e., the one containing the largest number of stars after 
each level of separation). The parameter Q is less affected by 
the separating procedure than s and in. Significant changes are 
only seen when large parts of the cluster are excluded (as in the 
step from 1 to 0.5cr for p Oph, when the cluster breaks up into 
two large subclusters). The elongation measure ^ varies signifi- 
cantly with the level of separation. For example, the remaining 
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Fig. 7. The clustering parameters s (diamonds), m (triangles), Q (squares), and ^ (asterisks; scale on the right-hand axis) of the 
largest remaining subcluster at different levels of separation. The numbers along the abscissa give the number of YSOs contained 
in the particular subcluster. 



subcluster in Taurus shows a filamentary structure and thus a 
larger^ (^ = 1.83) than the cluster as a whole. 

7. Summary and Conclusions 

We show that the normalised mean correlation length s and the 
mean edge length of the minimum spanning tree m, and in par- 
ticular the combination of both parameters, Q, as proposed by 
ICW04I are very useful tools to study the structures of star clus- 
ters, both observed ones and those from numerical simulations. 
We refine the definiton of the cluster area by using the nor- 
malised convex hull rather than a circular or rectangular area 
around the objects. Unlike s and m the parameter Q is indepen- 
dent of the definiton of the area of the cluster. In addition, it is 
less affected when removing stars from the cluster by separat- 
ing the MST. 

We introduce a new measure ^ for the elongation of a clus- 
ter. It is defined as the ratio of the cluster radius determined 
by an enclosing circle to the cluster radius derived from the 
normalised convex hull. This is a stable statistical measure not 
influenced by fractal substructre, which could also be applied 
to the filamentary structure of molecular clouds. 

The mean separation s increases with the evolutionary 
class, reflecting the expansion of the cluster The prestellar 
cores do not follow that sequence, leading us to the speculation 
that not all objects classified as prestellar cores will eventually 
form stars. The clustering values of the models lie roughly in 
the same range as those from observed clusters. A particularly 
good agreement is reached when the clusters have similar elon- 



gation values ^. No correlation with the Mach number or the 
wave number of the turbulence is found. We conclude that pos- 
sible influences of the turbulent environment on the clustering 
behaviour are quickly smoothed out by the velocity dispersion 
of the young stars. 

The temporal evolution of the clustering parameters shows 
that the star cluster builds up from several subclusters and 
evolves to a more centrally concentrated state. New stars are 
formed faster than the cluster expands. The projection of the 
3D models into a 2D plane changes the clustering parameters, 
but not the general behaviour with time. 
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Fig.A.l. Randomly distributed data points and the area they 
span according to different definitions (circle, rectangle, convex 
hull). 

Appendix A: Radius and area of a cluster 

For the normalisation of the parameters s and m the radius and 
the area (or volume) of the cluster are needed (see AppendixiBl. 
Since a star cluster has no well-defined natural boundary, dif- 
ferent approaches to determine the cluster area have been used. 
ICW04I define the cluster radius as the distance between the 
mean position of all cluster members and the most distant star 
and the area as a circle with the cluster radius: A = nR^, , . 

cluster 

Adami & Mazure ( 1999 1 define the area used for the normalisa- 
tion of the MST edge lengths as the maximum rectangle of the 
point set. However, both methods tend to significantly overes- 
timate the area of the cluster, in particular if the cluster is elon- 
gated or irregularly shaped rather than spherical. Therefore, we 
estimate the area A of the cluster using the convex hull of the 
data points, normalised by an additional geometrical factor tak- 
ing into account the ratio of the number of objects inside and 
on the convex hull: 

A = m(//)/[l - (vh/«)] (A.l) 

(Hoffman & Jain [T983l Ripley & Rasson [T977l . where H is the 
convex hull, m{H) its area, vh the number of vertices on H and 
n the number of objects. The correction factor is used since the 
convex hull by itself tends to be smaller than the true sampling 
window. To be consistent, we define the cluster radius as the 
radius of a circle with the same area A. In the three-dimensional 
case the volume V and the radius are defined analogously. The 
convex hull is computed using the programme Qhull (Barber et 
al. [T996t . 

Fig. lA. ll demonstrates that the definition of the cluster area 
is crucial, since it can differ by a factor of two or more. In 
the given example of randomly distributed data points, the area 
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of the circle is 46.33, that of the rectangle is 30.40, the area 
of the convex hull is 24.19, and the normalised area (Eg. lA.H 
is 24.83. For the calculation of the parameter Q = m/s the 
size of the radius/area is irrelevant (as long as the area used 
for the normalisation of m depends on the radius used for the 
normalisation of s or vice versa). The radius is cancelled since 
it is used to normalise both, m and s. 

Appendix B: Normalisation of the IVIinimum 
Spanning Tree 

The edge lengths of a minimum spanning tree depend on the 
number of points and on the area, therefore the mean edge 
length m has to be normalised, in order to compare the results 
from samples with different numbers of objects and/or differ- 
ent areas. Beardwood et al. ( 1959) show that the length of the 
shortest closed path through n points l(P") in a plane region 
of area A is asymptotically proportional to yfnA for large n. 
Since the number of edges in a MST is (n - 1), Hoffman & Jain 
J1983> andlCW04 claim that the expected length of a randomly 
selected edge of a MST is asymptotically proportional to 



«- r 



(B.l) 



They use this factor to normaUse the mean edge lengths of dif- 
ferent data sets. Beardwood et al. (1959 1, however, deal with 
the closed path through n points (i.e., the travelling salesman 
problem), not a MST, as cited incorrectly by Hoffman & Jain 
( fT^O and|CW04 Marcelpoil STW^ . on the other hand, de- 
duces the normalisation factor 

^JAJ7^ (B.2) 

for comparing the mean edge lengths of different samples. 
Since 

VM/(n-l) n , 

{a.i) 



lim 



lim 



• n - I 



1, 



both methods lead to the same result for large n, e.g. for 
n > 100 the difference is < 1% (see Fig. IB.U . In this work 
we use the normalisation factor of Marcelpoil (I1993I I. which 
seems more plausible to us. 

In the three-dimensional case the general result for k di- 
mensions of Beardwood et al. J1959> (their Equation 2) can be 
written as l(P") oc VVn^, leading to the factor 



(B.4) 



«- 1 

to normalise the mean edge lengths of the 3D MST. However, 
we follow the reasoning of Marcelpoil (I1993> and use the nor- 
malisation factor 

(B.5) 

in the 3D case. The volume V is the volume of the convex 
hull as discussed above. Again, the ratio of both normalisation 
factors asymptotically approaches 1, making them interchange- 
able for large n. 

Note that we use different definitions of the radius and the 
area from .C W04 . (Appendix IXt. resulting in an additional dif- 
ference between our and their parameters m and s. However, in 
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Fig. B.l. Mean edge lengths m for 200 randomly created 2D 
sets of points (3 < « < 400), normalised with the factor 15^2] 
(crosses) and IB. II (diamonds'), respectively, plotted versus the 
number of points. 
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Fig. C.l. The relation between the axis ratio a/b of an elliptical 
area and the elongation ^, with the Icr error also indicated. 



the calculation of Q the radius contained in the normalisation 
factor is cancelled out, so the difference in the Q parameters 
boils down to Eauation lB.3l Unlike m and 5, the more relevant 
parameter Q is the same for a large number of objects, indepen- 
dent of the normalisation method. To allow comparison, we list 
the parameters computed using the normalisation factor and the 
cluster radius according to lCW04l in Table^as well. 

Appendix C: Elongation of a cluster 

We notice that some of the model clusters are strongly elon- 
gated, causing a large difference in the cluster area depending 
on whether it is defined by the enclosing circle or by the nor- 
malised convex hull. We use this fact to propose a new, statis- 
tically stable characterisation of the elongation of a cluster. We 
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define the elongation measure ^ as the ratio 




(C.l) 



R' 



(conv. hull 
cluster 



This is an approach similar to the "filament index" T intro- 
duced by Adams & Wiseman J 19941 . However, while Adams 
& Wiseman (19971 use the actual area of a cloud obtained from 
column density maps, we have to estimate the area of the clus- 
ter of point sources by the convex hull. This makes ^ a reliable 
measure for the elongation, since it excludes the possibility that 
fractal substructure in an otherwise spherical cluster leads to a 
large ratio of the two radii. The measure ^ could also be used 
to quantify the filamentary structure in molecular clouds. 

In order to test if ^ is indeed a good measure for the elon- 
gation we place 350 points randomly on an elliptical area with 
increasing axis ratio (1 < a/b < 10). To minimise the statistical 
scatter we perform 500 different realisations for each a/b and 
determine the mean ^ and its standard deviation for each step. 
Figure RTTI shows that ^ increases with the axis ratio a/b. For 
a/b - 1 (i.e., a circle), f 1, as expected. 



